Water drives peptide conformational transitions 
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Transitions between metastable conformations of a dipeptide are investigated using classical 
molecular dynamics simulation with explicit water molecules. The distribution of the surrounding 
water at different moments before the transitions and the dynamical correlations of water with the 
peptide's configurational motions indicate that water is the main driving force of the conformational 
changes. 



Investigations of protein dynamics have recently led to 
believe that water plays the major role in protein mo- 
tion. There is a large body of experimental and sim- 
ulation evidences [1-4] showing a close connection be- 
tween the water dynamics and the protein conformations. 
Frauenfelder and colleagues have experimentally shown 
that protein dominant conformational motions are slaved 
by the hydration shell and the bulk solvent [5] , while the 
protein molecule itself provides an 'active matrix' nec- 
essary for guiding the water's dynamics towards biolog- 
ically relevant conformational changes. The change in 
water dynamics at the shell of up to almost a dozen wa- 
ter molecule diameters around proteins is found in [6] . 

Despite extensive research on protein dynamics the in- 
vestigations of elementary conformational motions are 
rare. Specific molecular mechanisms, including the in- 
volvement of water molecules, that drive the conforma- 
tional moves are highly demanded as they ultimately de- 
fine all rearrangements of proteins as a whole. 

In this work we analyse molecular dynamics (MD) 
simulated peptide focusing on the moments of ele- 
mentary conformational changes including explicit wa- 
ter molecules. We show that water indeed drives the 
changes, we elucidate the specific mechanisms of this phe- 
nomenon. 

We study a zwitterion L-alanyl-L-alanine, Fig. 1, a 
very convenient model because i) the conformation of 
the molecule is completely defined by the two dihedral 
angles ip and </>, ii) in water the conformation ip « 2.5, 
(j> « —2.2 radians is prevalent, however very rare transi- 
tions to two other metastable conformations take place, 
and in) the transitions only happen in water because of 
the molecule's charged ends. 

The three well separated metastable states, clearly vis- 
ible on the density of states, Fig. 1, allow to introduce a 
simple natural discretisation of the conformational states. 
By also discretising time with a step At the continuous 
MD trajectory can be converted into a string of symbols 
{si},i = 0...7V, where Si equals to L A\ L B\ or 'C de- 
pending on where the trajectory point falls at the time 
moment ti, N is the number of time steps. 




FIG. 1: Left: L-alanyl-L-alanine zwitterion and the nor- 
malised density of its conformations (Ramachandran plot) 
formed by a 1/xs trajectory; right: the same probabilities em- 
phasising the presence of two minor conformations and the 
partitioning for symbolisation 



An important problem is how to identify the moments 
of transition. The described discretisation of the confor- 
mational states defines boundaries of the states. How- 
ever, they delineate the density of conformations aver- 
aged over the whole trajectory. As we are interested in 
the dynamically metastable states, that is the configu- 
rations in which the trajectory spends significantly more 
time compared to the time it spends in transitions be- 
tween the configurations, a dynamical model capable of 
identifying the transitions is required. Also, the trajec- 
tory does not go directly from one state to another, in- 
stead it winds in a complicated manner, often crossing 
the states boundaries many times. 

One of the most popular description of protein dynam- 
ics lately is the Markov State Model (MSM) [7]. The 
model specifies the probabilities of each of the discrete 
states as well as the probabilities of the transitions be- 
tween them. The MSM transition matrix can be cal- 
culated from the MD trajectory by counting the state 
changes and for the studied molecule it is 

ABC 
A 0.997 0.002 0.001 
B 0.261 0.737 0.002 
C 0.097 0.001 0.901 
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where the value at row i and column j gives the proba- 
bility of going to state j being currently at state i. MSM 
provides many useful quantities describing the system [7] . 
In particular, it tells that the transition A — >> B happens 
once in 2.8ns on average. We will concentrate on this 
specific transition for the rest of the paper. 

The moments of the A — >> B transitions within the 
MSM framework are the values of U for which Si is equal 
to B while being equal to A at the previous time mo- 
ment t{-\. The time precision of identifying the transi- 
tions is At, which is insufficient for studying the transi- 
tions themselves. The model is valid only for relatively 
large time steps, that follows from the requirement for 
the transitions to be history independent (statistically 
uncorrelated). For this peptide the minimal valid time 
step is ~ 6ps. This value is of the same order as the 
period of fluctuations within each conformational state 
and, most importantly, this is approximately the dura- 
tion of the process of the trajectory passing from state 
to state. Therefore, the MSM has to be augmented in 
order to be able to describe the dynamics at significantly 
shorter time steps. 

For this purpose we build a variant of the hidden 
Markov model using the same configurational states of 
the peptide. Specifically, we use the 'e-machine' by 
Crutchfield et al [8, 9]. Instead of the conformational 
states, Si, themselves (A,B,C) we consider the /-long 
sequences of states V$ = {,Si_/ + i . . . Si-2Si-iSi}. The 
advantage of such description is that for a small time 
step, even if the original states are correlated over sev- 
eral steps, for long enough sequences V^, these new states 
(the sequences) are uncorrelated. We, therefore, can 
build a Markov model on these new states. 

The Markov property on the original states is fulfilled 
for At > 6ps. For At = bps the sequences of at least 
two time steps are required to build the hidden Markov 
model. The model is shown in Fig. 2. Here the states '0', 
'2', and '1' correspond to the conformational states A, 
£?, and C since they mostly consist of the sequences = 
A A, 2 = BB, and 1 = CC respectively. An additional 
state '2' describes the transition process from A to B. It 
consists of the sequence AB and has two main transitions 
from it. With the probability 0.643 the following symbol 
is B and the next state is 3 which means that the system 
is transferred to the conformational state B. There is, 
however, a significant probability of 0.351 for the next 
symbol to be A, which describes the return to state or 
the original conformation A. 

This analysis illustrates the advantage of the hid- 
den Markov model: it elucidates the mechanism of the 
A — >• B transition. It also explains where the non-Markov 
property comes from and gives the time scale limit at 
which different pathways of the conformational transi- 
tion start to differentiate from each other. 

Using the hidden Markov model the time step can be 
reduced to 0.3ps, the model is described in Fig. 3. The 
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FIG. 2: Hidden Markov states for the time step of 5ps; left: 
conformations corresponding to all time frames belonging to 
the hidden Markov states; right: the hidden Markov states 
and transitions between them, the labels on the arcs indicate 
the symbol following the state as the result of the transition 
and the transition probability 



transitions are the moments when the system enters state 
9 being before in any other state. The model predicts 
that the transitions last on average for « lps, Fig. 3, 
right. Thus, the 0.3ps precision in identifying them is 
satisfactory. This gives us a tool to investigate what 
happens at different moments before the transition. In 
particular, we can study the behaviour of water. For this 
we collect the time frames at specific times before the 
transitions, Fig. 4. 

We calculate the density of oxygen (hydrogen) atoms 
by averaging over the selected time frames. The obtained 
field p(x, t) gives the probability of finding water atoms 
at various locations x around the peptide at times t in 
advance of the transition. 

The transition from conformation A to conformation 
B corresponds to « 180° flip of the NH3 group (the right 
hand side of the molecule in Fig. 5). Both ends of the 
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FIG. 3: Left: the e-machine for the time step 0.3 ps, the length 
of the sequences is / = 4, state '0' corresponds to the original 
state 'A' (mostly consists of the sequences AAAA), similarly, 
state '9' corresponds to l B\ state '9' can only be reached from 
state '0' via the states '2', '7', '4', '8' that describe the mecha- 
nism (pathways) of the A — »• B transition; right: three typical 
cases during the transition: direct transition (top, probability 
0.61), transition with several recrossings (middle, probability 
0.04), failed attempt of transition (bottom, probability 0.07), 
the probabilities of these cases are given assuming that the 
probability of going from state '0' to state '9' by any possible 
route is 1 
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FIG. 4: Collecting the time frames for the 'time before tran- 
sition' statistics; the dots on the 'time' axes are the transition 
moments 

molecule posses charges, negative on the CO2 and pos- 
itive on the NH3 sites, which leads to relatively strong 
attachment of water molecules at the ends. Hydrogen 
bonded water molecules to the oxygens and to the hy- 
drogens form the dense areas of water corresponding to 
more rigid hydrogen bonds network, that is more stable 
structures. 

This is an intuitively clear result. More interesting is 
the moment just before the transition, Fig. 5, middle. 
Even though the overall structure of the dense areas re- 
mains similar to the A state, evidently, the total area 
of the dense water is significantly reduced. We quan- 
tify this decrease by measuring the volume of the dense 
areas depicted in Fig. 5. The result as the function of 
time before the transition is given in Fig. 6. During the 
time period when the rotation of the NH3 group is most 
significant, from w 9ps to ~ lps, the size of the dense 
areas of water is more than 2 times for oxygen and 3 
times for hydrogen smaller compared to those during the 
stable periods of conformation A. This corresponds to 
more diffuse character of the hydrogen bonded network 
of water molecules as they tend to appear at different, 




FIG. 5: Density isosurfaces at 0.7 (average: 0.5) of oxygen 
(red) and 0.1 ^ (average: 0.08) of hydrogen (blue) for states 
A (top), B (bottom), and « 3ps before the transition 
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FIG. 6: The size of the high density areas (the isosurface 
values are the same as in Fig. 5 for oxygen (red) and hydrogen 
(black)) during the transition from state A to state B (the 
points for state B are artificially shifted to the value of ~ 
0.0bps 



less concerted locations for different transitions. 

However, the most surprising effect following from this 
analysis is that the density of water starts reducing as 
early as ~ 50ps in advance of the transition. This is 
almost 10 times earlier than the actual conformational 
change of the peptide! 

The above analysis reveals the changes in water den- 
sities but it does not show which part of it is directly 
correlated with the changes in the peptide angles. We 
analyse the dynamical correlations between the dihedral 
angles and the water density using the Linear Stochastic 
Estimation (LSE) model (originally developed for visual- 
ising coherent structures in turbulent flows [10] and the 
identification of noise sources in turbulent jets [11]). 
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The density field p(x, t) is converted to a time fluctuat- 
ing field by subtracting the time average p(x): p'(x, t) = 
p(x, £) — p(x). In the LSE framework an approximation 
p(x, £) = a4>(t) + f3$(t) to //(x, t) is found that minimises 
the residual error (// — p) t [12], where ^(t) are the 
angles averaged over all the frames at time t and a, f3 
are constants. p(x, t) represents the density time fluc- 
tuations correlated with the angles. It is calculated by 
solving for a, f3 the system of linear equations, obtained 
from (p'(x.,t)<i>(t)) = ({ot<j)(t) + f3$(t)}(j)(t)} (and simi- 
larly for ip(t)) in the assumption of the zero mean of the 
uncorrelated part of the density fluctuations: 

a${t)4>{t)) + f3${t)4>{t)) = <p'(x, 
a(0(t)rP(t)) + PmWt)) = <p'(x,t)^(t)>. 

The density fluctuations field p'(x, t) and its part cor- 
related with the peptide's conformation p(x, t) are shown 
in Fig. 7 for several representative time moments. 

The water fluctuations (right column) are significantly 
stronger just before the transition (2Aps) compared to 
the stable period (~ lOOps). Very surprisingly the con- 
ditionally averaged water fluctuations (left column) are 
virtually uncorrelated with the peptide at all times ex- 
cept for the short period immediately before the tran- 
sition (some fluctuations are also noticeable as early as 
~ 20j98 before the transition, which agrees with the on- 
set of the density decrease in Fig. 6). Interestingly, at 
Ops, when the transition process is complete, the water 
density becomes uncorrelated with the peptide, similar 
to the stable periods. However, the fluctuations of it 
remain strong, only slightly weaker than at 2Aps. We 
explain this effect by the large inertia of the water shell, 
that needs relatively long time for the fluctuations to 
settle down. The fact that these post-transition water 
fluctuations are decoupled from the peptide emphasizes 
the discovered phenomenon of strong water-peptide in- 
teractions precisely during the transition process. 

Summarising, we have found that (i) w bps before the 
transition, when the dihedral angles change the most, 
the water density significantly reduces; (ii) the change 
of water density begins at ~ 50ps before the transition, 
10 times earlier than the changes in the angles (iii) dur- 
ing the transition the dynamics of water density becomes 



highly correlated with the dynamics of the angles; and 
(iv) these correlations are completely absent during the 
stable conformation periods. 

We conclude that water and the peptide behave as an 
integral dynamical system. During the conformational 
transition the peptide and the surrounding water un- 
dergo transitions together. This is in contrast to the 
metastable periods when their dynamics is essentially de- 
coupled. The transition is characterised by a more dif- 
fuse hydrogen bonds network of water. The changes in 
the peptide are substantially delayed in time. Thus, it is 
likely that water drives the whole process of conforma- 
tional transitions. 
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FIG. 7: The cross-section (the value of the z coordinate is 
chosen such that the cutting plane passes through the centre 
of mass of the peptide) of the density fluctuation field p'(x, t) 
(right), and its part correlated with the peptide's conforma- 
tion p(x, t) (left) for oxygen; the time before transition are, 
from top to bottom, 94.8ps, 21ps, 2.4ps, and Ops; the density 
scale is restricted to the —0.2 ... 0.2 interval for clarity, the 
maxima of the peaks reach the values of 0.78 and -0.42 



